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ABSTRACT 

We present a new method to detect and quantify mass segregation in star clusters. 
It compares the minimum spanning tree (MST) of massive stars with that of random 
stars. If mass segregation is present, the MST length of the most massive stars will 
be shorter than that of random stars. This difference can be quantified (with an 
associated significance) to measure the degree of mass segregation. We test the method 
on simulated clusters in both 2D and 3D and show that the method works as expected. 
We apply the method to the Orion Nebula Cluster (ONC) and show that the method 
is able to detect the mass segregation in the Trapezium with a 'mass segregation ratio' 
A-msr = 8.0 ±3.5 (where Amsr = 1 is no mass segregation) down to 16 M Q , and also 
that the ONC is mass segregated at a lower level (~ 2.0 ± 0.5) down to 5 M . Below 
5 M Q we find no evidence for any further mass segregation in the ONC. 
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1 INTRODUCTION 

It is thought that many young star clusters show 'mass seg- 
regation' - where the massive stars are more centrally con- 
centrated than lower-mass stars (see Section 2 for a review 
of current methods for detecting mass segregation). 

There are two possible origins of mass segregation in 
star clusters. It may be dynamical, in that mass segrega- 
tion evolves within an initial ly non-mass segregated clus- 
ter d ue to dynamical effects ( Chandrasckhar 1 19421 ; ISpitzerl 
1 19691 ). Or it may be primordial, where mass segregation is an 
outcome of the star fo rmation process dMurrav fc Linlll996l; 
iBonnell fc Batell2006l ). Information about the presence and 
degree of mass segregation thus provides strong constraints 
on star cluster formation and evolution. However, there are 
currently no good methods by which mass segregation can 
be determined and quantified absolutely. This seriously lim- 
its the interpretation of observations of mass segregation, 
and especially our ability to compare different clusters. 

In this paper we present a new method, based on the 
minimum spanning tree, which allows us to quantify the de- 
gree of mass segregation in a cluster, and examine how mass 
segregation changes with stellar mass. In Section 2 we review 
the limitations of current methods of determining mass seg- 



regation. In Sections 3 and 4 we introduce our new method 
and test it against a number of artificial clusters and the 
ONC. In Section 5 we discuss the method's limitations and 
some ways in which it might be improved, and we summarise 
our results in Section 6. 



2 MASS SEGREGATION 
2.1 What is mass segregation? 

In this paper we will take as a working definition of mass 
segregation that the most massive stars are not distributed 
in the same way as other stars. In particular, that they have 
a more concentrated distribution. 

This is not the only possible definition of mass segre- 
gation. It would be possible to define mass segregation as 
a significant difference in the mass function^ (MFs) at dif- 
ferent positions in a cluster. Indeed, this is often the way 
mass segregation is currently defined (see Sectionf23J. How- 
ever, we feel such a definition is problematic as it raises the 
question of how many stars need to be included in order to 
properly sample the MF and therefore tell if two MFs are 
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We deliberately distinguish between the present-day mass func- 
tion and the initial mass function. 



2 R. J. Allison et al. 



different. Also, how much do the mass ranges sampled need 
to overlap in order to differentiate MF slopes? Both of these 
questions are difficult to answer without assuming a MF a 
priori. 



that the cluster centre must be determined; it is generally 
chosen as where the luminosity is highest - i.e. where the 
massive stars are. 



2.2 Current methods for detecting mass 
segregation 

The current methods can be loosely separated into two 
groups, those that involve fitting a density profile and char- 
acteristic radii to various mass ranges; and those that trace 
the variation of the MF with radius. 

None of the current methods to examine mass segrega- 
tion give a model independent, quantitative measure of the 
amount of mass segregation in a cluster (and most are not 
quantitative at all). The current methods of determining the 
presence of mass segregation also rely on a determination of 
the centre of the cluster, which in itself may be very difficult 
to define. 

Density profiles are usually fitted with a model profile 
(e.g. a King (1966) model) or as a cumulative distribution 
function for different mass 'bins'. This allows the variation in 
distribution to be easily seen, and the 'degree' of mass seg- 
regation can be measured by the change of the slope of the 
distribution function. From these distributions characteris- 
tic radii for each mass bin (e.g. core radius) c an be found. 
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2008). However, the ac- 



curacy of this m e thod has been brought into question by 
iGouliermis et all ((2004), who show that this method is 
highly dependent on the number of mass bins, the size of 
the bins, and on the models used to fit the density pro- 
file. |c onvcrsc fc Stahlerl | 2008) use a method which is sim- 
ilar to comparing cumulative distributions. They study the 
amount of mass inside a projected radius and obtain a quan- 
tifiable measure of the degree of mass segregation from the 
area between this curve and a curve for a cluster with no 
mass segregation. However, thi s relies on having a mode l for 
a non-mass segregated cluster. IGouliermis et all l|2008h use 
the variation of the 'Spitzer radius' (the Spitzer radius is 
the r.m.s. distance of stars in a cluster around the centre 
of mass) with luminosity to determine the presence of mass 
segregation. 

A variation of the MF with radius will show the pres- 
ence of mass seg regation as a shallowing of the slope of the 
MF with radius rtde Griis et al. 2002l|3Q; IGouliermis et al.l 
20041 : ISabbi et al.ll2008l : iKumar et al.ll2008l : lHaravama et al.1 
20081 ). Mass functions are calculated for various annuli, such 



that in a mass segregated cluster the radii closest to the cen- 
tre of the cluster will contain the most massive stars, and 
so they will have shallower slopes than those at larger radii. 
This method also suffers from a strong depe ndence on the 
choic e of annuli and mass bin s for the MF ()Ascenso et al.l 
|2008T ). IGouliermis et al.1 l|2004h find that a unique set of an- 
nuli to use for comparison is not simple to choose because of 
the variation of the shape and slope of the MF as the choice 
of bin size changes for individual clusters. A particular prob- 
lem is that low-mass stars are often difficult to detect in the 
central regions leading to relatively small regions of overlap 
in the MFs at different radii (which are the slopes to be 
compared) . A further complication is introduced by the fact 



3 MINIMUM SPANNING TREE METHOD 

In this section we detail our new method for defining and 
quantifying mass segregation using a minimum spanning 
tree (MST). 



3.1 The MST 

The MST of a sample of points is the path connecting all 
points in a sample with the shortest po ssible pathl ength, 
which contains no closed loops (see, e.g. |Priml ll957). The 
length of the MST is unique, but the exact path might differ 
in some special circumstances. For example, an equilateral 
triangle of stars named 1,2, and 3 has two possible MSTs - 
one connecting stars 1 & 2 and 2 & 3, and one connecting 
stars 1 & 2 and 1 & 3. But the lengths of both MSTs are 

identical. 

We have used the al g orithm of iPriml l|l957t ) (see also 
ICartwright fc Whitworthl Hjol)) to construct our MSTs. 
We construct an ordered list of the separations between all 
possible pairs of stars. Stars are then connected together in 
'nodes' starting with the shortest separations and proceed- 
ing through the list in order of increasing separation, form- 
ing new nodes as long as the formation of the node does not 
create a closed loop. 



3.2 Quantifying mass segregation 

Following our definition of mass segregation (see Section l2~l"1) 
we can detect and quantify mass segregation by comparing 
the typical MST of cluster stars with the MST of the most 
massive stars. 

The method is based on finding the MST of the N most 
massive stars and comparing this to the MST of sets of N 
random stars in the cluster. If the length of the MST of the 
most massive stars is significantly shorter than the average 
length of the MSTs of the random stars, then the massive 
stars have a different, and more concentrated, distribution - 
hence the cluster is mass segregated. The ratio of the average 
random MST length to the massive star MST length gives a 
quantitative measure of the degree of mass segregation with 
an associated error (i.e. how likely it is that the massive star 
MST is the same as that of a random set of stars). 

The algorithm proceeds in the following way: 

1. Determine the length of the MST of the JVmst most mas- 
sive stars; /massive- The MST of a subset of the Nmst most 
massive (or what ever subset is of interest) stars is con- 
structed. This MST has length /massive- 

2. Determine the average length of the MST of sets of Nmst 
random stars; (/norm)- Sets of Nmst random stars are con- 
structed, and the average length, (Z n0 rm) of their MSTs is 
determined. There is an dispersion associated with the av- 
erage length of the random MSTs which is roughly gaussian 
and so can be quantified by the standard deviation of the 

lengths (/norm) ± O"norm- 

Numerical experiments have shown that 50 random sets 
is sufficient to obtain a good estimate of the errors. However 
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using hundreds of random sets tends to result in smoother 
trends. We suggest using 500 - 1000 random sets for low 
Nmst and at least 50 for high Nmst. 

Note that the error cr norm on (/norm) is always large for 
small -/Vmst due to stochastic effects when randomly choos- 
ing a small number of stars (see Section T4.2p . 
3. Determine with what statistical significance i maB sive differs 
from (/norm)- We define the 'mass segregation ratio' (Amsr) 
as the ratio between the average random path length and 
that of the massive stars: 

. (/norm) , ^norm / 1 \ 

Amsr = -. ± -, (1) 

'massive 'massive 

where an Amsr of ~ 1 shows that the massive stars are 
distributed in the same way as all other stars, an Amsr 
significantly > 1 indicates mass segregation, and an Amsr 
significantly < 1 indicates inverse-mass segregation (i.e. the 
massive stars are more widely spaced than other stars). The 
more (significantly) different the Amsr is from unity, the 
more extreme is the degree of (inverse-)mass segregation. 
4- Repeat the above steps for different values of -/Vmst to 
determine at what masses the cluster is segregated, and to 
what degree at each mass. Clearly, the choice of -/Vmst is 
arbitrary, so the Amsr needs to be determined for many 
values of -/Vmst to gain information on how the degree of 
mass segregation changes with mass. 

This method has three major advantages over current 
methods. Firstly, it gives a quantitative measure of mass seg- 
regation with an associated significance, allowing different 
clusters to be directly compared. Secondly, it does not rely 
on defining a cluster centre or any special location in a clus- 
ter (allowing it to be applied to highly substructured regions 
as well). Thirdly, it is applicable not just to the most mas- 
sive stars, but to any subset of stars (or brown dwarfs) that 
one might think has a different distribution to the 'norm'. 

It should be noted that whilst we are applying our 
method to simulated data and therefore know the mass of 
each star, an absolute determination of the mass is not re- 
quired, just a measure of the relative masses (i.e. luminos- 

ity). 

4 THE METHOD IN ACTION 

4.1 A non-mass segregated cluster 

We tested the m ethod on a large sample of Plummer spheres 
l|Plummerl 1 1 9 1 if ) consisting of 1000 single sta rs with masse s 
sampled from a Kroupa initial mass function (lKroupall2002l) . 
The masses are assigned randomly to stars so these clusters 
should not be mass segregated (except by chance). 

Tables [T] and [2] show the fractions of random clusters 
that are found at la significance to be either inverse-mass 
segregated (Inverse-MS: Amsr < 1), mass segregated (MS: 
Amsr > 1) or that showed no mass segregation (No MS: 
Amsr ~ 1) for various numbers of members in the MST 
(-/Vmst). Table [1] uses the 3D positions of the stars, Table [2] 
is for a 2D projection as would be observed. In both the 2D 
and 3D clusters we find that false positives and negatives 
(i.e. Amsr > lo~ from unity) occur with equal frequency 
around l/6th of the time - exactly what would be expected 
if the error is Gaussian. 







/Vmst 






5 


10 


100 


200 


Inverse-MS 


14.1% 


15.5 


17.1 


15.2 


MS 


12.6% 


14.1 


17.0 


17.2 


No MS 


73.3% 


70.4 


65.9 


67.6 



Table 1. The percentage of Plummer spheres which show evi- 
dence of inverse-mass segregation (Inverse-MS), mass segregation 
(MS), and no mass segregation (No MS) for various numbers of 
members in the MST, -/Vmsti i n 3 dimensions. 







/Vmst 






5 


10 


100 


200 


Inverse-MS 


13.1% 


14.9 


17.2 


17.1 


MS 


10.4% 


12.2 


17.0 


19.1 


No MS 


76.5% 


72.9 


65.8 


63.8 



Table 2. As Table[T]but for clusters projected into 2 dimensions 
as would be observed. 



4.2 Mass segregated clusters 

We then test the method with mass segregated clusters. 
These are created by first producing Plummer spheres of 
1000 stars in the same way as before. The positions of the 
most massive x per cent of the stars are then swapped with 
randomly chosen stars positioned inside the x per cent num- 
ber radius, so only the most massive stars are inside this 
inner radius. The routine creates clusters in which the most 
massive stars are centrally concentrated, and so are mass 
segregated. This is not necessarily a realistic configuration 
as Nature might produce, but it is one in which we can easily 
control the degree of mass segregation. 

Each cluster contains 1000 stars. Therefore, a 5 per cent 
MS places the 50 most massive stars within the inner 5 per 
cent number radius. For 10 per cent MS, it is the 100 most 
massive stars, and so on. We vary the radii in which the most 
massive stars are placed from the 5 per cent number radius 
to the 80 per cent number radius. Note that at large values 
the cluster is more like one in which the low-mass stars have 
been concentrated at the outskirts rather than one in which 
the high-mass stars are centrally concentrated. 

Tables [3] and [4] show the percentage of clusters which 
the method finds to have mass segregation at > la signifi- 
cance for different numbers of stars in the MST (-/Vmst). In 
brackets after the percentage is the typical significance of the 
mass segregation. As before, Table [3] uses the 3D positions 
of the stars, while in Table [4] a 2D projection is used. 

It is clear from Tables [3] and [4] that the ability of the 
method to detect mass segregation depends on a combina- 
tion of the level of mass segregation and the number of stars 
in the MST. In the first column, when A^mst = 5 the method 
is able to detect mass segregation at roughly 2a significance 
when the %MS is low. As it only uses the 5 most massive 
stars to look for mass segregation, it is not good at finding 
mass segregation when the %MS involves hundreds of stars. 

In contrast, when A^mst = 500 (i.e. half of the stars 
in the cluster), the method is poor at spotting when the 
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%MS 






-Nmst 








5 


10 


100 


200 


500 


5 


100 (2) 


100 (3) 


100 (3) 


90 (2) 


40 (1) 


10 


100 (2) 


100 (3) 


100 (7) 


100 (4) 


70 (1) 


20 


100 (2) 


100 (3) 


100 (7) 


100 (10) 


100 (3) 


50 


80 (2) 


100 (2) 


100 (5) 


100 (8) 


100 (15) 


80 


30 (1) 


80 (1) 


100 (4) 


100 (6) 


100 (10) 



Table 3. The frequency of a positive detection of mass segrega- 
tion for various -/Vmst and various degrees of mass segregation 
(%MS) in a 3D cluster. The numbers in parentheses denote the 
typical significance (in sigma) at which mass segregation is de- 
tected. 



%MS 






-/Vmst 








5 


10 


100 


200 


500 


5 


100 (2) 


100 (2) 


100 (3) 


70 (2) 


30 (2) 


10 


100 (2) 


100 (2) 


100 (6) 


100 (2) 


50 (2) 


20 


100 (2) 


100 (2) 


100 (6) 


100 (9) 


80 (2) 


50 


70 (1) 


100 (2) 


100 (5) 


100 (7) 


100 (12) 


80 


(-) 


60 (1) 


100 (3) 


100 (6) 


100 (10) 



Table 4. As Table [3] but for a 2D projection as would be ob- 
served. 
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Figure 1. The evolution of Amsr with -/V^ST f° r a Plummer 
sphere with an initial 5 percent MS. The dashed line indicates 
Amsr=1 i-e. No MS. 



mass segregation only involves a few stars. However, it be- 
comes extremely good when finding mass segregation involv- 
ing many hundreds of stars. 

Looking across Tables [3] and [3] the highest significances 
for finding mass segregation (up to 10 or 15a results) occur 
when JVmst is equal to the number of stars that have been 
mass segregated. The best results are for large -Vmst as the 
variance in (Z n0 rm) is smallest when -/Vmst is large. Note 
that, as mentioned above, when -/Vmst is small the variance 
in (/norm) is large due to stochastic effects. 

Therefore, it is vital to vary -/Vmst in order to examine 
mass segregation. However, this is an important diagnostic 
tool, as variations of Amsr and its significance with -/Vmst 
contain information about how, and down to which mass 
mass segregation is found. 

In Fig.[T]we show the variation of the Amsr with -/Vmst 
for a 5 per cent level of mass segregation (i.e. the 50 most 
massive stars are segregated). The Amsr is around 9 for 
-/Vmst < 50, and then drops sharply toward unity when 
-/Vmst > 50. This shows two features of mass segregation 
that can be extracted using this method. 

Firstly, it shows that only the 50 most massive stars are 
mass segregated (as described above) . However, it also shows 
that the 50 most massive stars are equally mass segregated. 
That is, the method places the 50 most massive stars at the 
centre of the cluster but does not order them in any specific 
way within the centre. 

To illustrate this we show in Fig. [2] the variation of the 
Amsr with -/Vmst for a cluster in which the 10 most massive 
stars are placed in the inner 1 per cent radius, and the 11th 
to 70th most massive stars in the 1-7 per cent radius. The 
method clearly detects that there are two levels of mass 
segregation - one involving the 10 most massive stars, and 
one which involves the 70 most massive stars. 



30 



25 - 








20 40 60 80 100 

Nmst 

Figure 2. The evolution of Amsr with Wmst for a Plummer 
sphere with an initial 1 per cent and 7 per cent MS. The dashed 
line indicates AjvfSi?=l i-e. No MS. 

4.3 A complex mass segregated cluster 

We also apply the method to a more realistic and complex 
cluster. In Fig.0we show a complex 1000-body clustefl The 

2 Th e cluster has been evolved fro m a cold fractal distribution 
(see, iGoodwin fc Whitworthl l2004h ; we will discuss simulations 
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Figure 3. The distribution of stars in a cluster evolved from cold, 
fractal initial conditions. The triangles show the positions of the 
five most massive stars, the squares the positions of the 6th to 
12th most massive stars. 



triangles show the positions of the 5 most massive stars and 
the squares the positions of the 6th to 12th most massive 
stars (the reason for these choices will be explained below). 

Interestingly, the cluster has collapsed into a config- 
uration in which the 5 most massive stars form a dense 
Trapezium-like cluster to one side, and the 6th to 12th most 
massive stars have formed a separate 'clump' on the other 
side. For this reason the cluster has no well-defined centre 
which serves to illustrate the strength of the MST method 
in dealing with unusual and clumpy clusters. 

In Fig. [4] we show the change of the 2D Amsr with 
Nmst for the cluster illustrated in Fig. [3] Fig. U shows that 
in this cluster there are 3 'levels' of mass segregation. Firstly, 
there is the presence of a Trapezium-like system formed 
from the five most massive stars (with an Amsr of around 
12). Secondly, there is another system, containing the 6th to 
12th most massive stars, which shows significantly less mass 
segregation than the Trapezium-like system (an Amsh of 
around 4). Thirdly, the 50 most massive stars in the clus- 
ter show a steadily decreasing degree of mass segregation. 
Beyond the 50th most massive star the cluster shows no 
evidence for mass segregation at all. 

We note that the second clump was not obvious in our 
initial 'eyeballing' of the simulation. It only became obvious 
when we plotted the positions of the 6th to 12th most mas- 
sive stars after the MST method alerted us to there being 
something special about these stars. 
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Figure 4. The evolution of the 2D A^msr with iVjyiST f° r the 
cluster illustrated in fig. [3] The dashed line indicates an Amsh 
of unity, i.e. no mass segregation. 
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Figure 5. The evolution of Amsr with AfjyiST for the ONC. The 
dashed line indicates an Amsr of unity, i.e. no mass segregation. 



4.4 The Orion Nebula Cluster 

Finally, we apply the method to real data, specific a lly th e 
Orion Nebula Cluster (ONC) data of iHillenbrandl l| 19971 ). 
We use the 900 stars for which IHillenbrandl (|l997t ) provide 



like this in detail in a later paper, for now the cluster is merely 
used as an illustration of the method. 



masses. We note that this is not an ideal dataset as many 
(presumably low-mass) stars lack masses. However it serves 

to il lustrate the method. 

iHillenbrand fc Hartmannl (|l998l ) show, using cumula- 
tive distribution functions and mean stellar mass as a func- 
tion of radius, that the ONC is mass segregated. They find 
evidence for mass segregation in the ONC for stars more 
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massive than 5 Mq within 1 pc, with less compelling evi- 
dence beyond this radius. 

Figure [5] shows the change of Amsr with TVmst for 
the ONC. The MST method clearly picks out the mass 
segregation of the Trapezium system at A^mst = 4 with 
Amsh = 8.0 ± 3.5. However, the MST method also shows 
that there appears to be a secondary level of mass segrega- 
tion involving the 9 most massive stars (> 5 Mp) in the ONC 
in agreement with Iffillenbrand fc Hartmannl ij 19981 ). Whilst 
the significance of the Amsr is low (around 2.0 ±0.5), there 
is a clear trend in that each of the 5th to 9th stars show 
the same increased level of mass seg regation. However, un- 
like |mIe^randAHartman3 (jl998h we find no evidence of 
mass segregatation below 5 Mq. 



4- Repeat the above steps for different values of Nmst 
to determine at what masses the cluster is segregated, and 
to what degree at each mass. 

By examining the difference between the MST of a sub- 
set of (massive) stars and an equal number of random stars it 
is possible to quantify the level of (inverse-) mass segregation 
in a cluster. Tests on artificial clusters show that the method 
behaves as expected and can identify mass segregation. 

We apply the method to the Orion Nebula Cluster 
(ONC) and find that the Trapezium is mass segregated with 
an Amsr of 8.0±3.5. We also find that the 5th to 12th most 
massive stars down to around 5 Moalso show evidence of 
being mass segregated with an Aa/sh of 2.0 ± 0.5. Below 5 
M0we find no evidence for mass segregation. 



5 FURTHER APPLICATIONS AND FUTURE 
WORK 

In future papers we will apply our new method to more 
observational and theoretical data to look for mass segre- 
gation, and also to examine if low-mass stars have different 
distributions to brown dwarfs and/or high mass stars. 

Binaries. In our tests we have not included binary stars, 
and they will have an effect on the lengths of MSTs. If a 
binary is resolved, then it is very likely that the two com- 
ponents will be linked as a node (if this is not the case it 
would be unclear that the system really was a binary). In- 
deed, triples and quadruples will also be linked as a subset 
within the MST. This raises the possibility that MSTs could 
be very useful in locating binary and multiple systems by 
looking for short links within the MST. However, as noted 
by Cartwright & Whitworth (2005) care must be taken in 
dealing with binaries. 

Incompleteness. In many clusters there is significant in- 
completeness, especially in regions around the most massive 
stars where low-mass stars cannot be observed (if they are 
present). This presents a significant problem as it is impossi- 
ble to know if there are many low-mass stars in the 'central' 
regions. If there are, then an average MST length would 
be short. If not, it would be long (this is exactly the same 
problem faced by other methods) . We will explore this prob- 
lem in a future paper by creating and analysing synthetic 
observational datasets. 



6 SUMMARY 

We have outlined a new method of determining and quan- 
tifying mass segregation in a cluster by using a minimum 
spanning tree (MST) . The algorithm proceeds in the follow- 
ing way: 

1. Determine the length of the MST of the Nmst most 
massive stars; Z maS sivc 

2. Determine the average length of the MST of sets of 
Nmst random stars; (Znorm) 

3. Determine with what statistical significance /massive 
differs from (Z norm )- 

A (^norm) , ^iiorm 

Amsr = ± -, 

'massive 'massive 
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